CP-53335, topology: do not raise exception when loading invalid distance matrices (NUMA) #6249

psafont · 2025-01-24T15:19:49Z

Unreachable nodes do not contain any CPUs, and therefore VCPUs cannot be
scheduled on them. They marked with a value of (2ˆ32) - 1. Instead of failing
to produce a NUMA object that allows for scheduling, create an object that
contains only schedulable NUMA nodes. This means changing how the
datastructures node_cpus and candidates are created to ignore the unreachable
ones.

Fixes two minor potential issues (distances being NaN, and adding duplicates to candidates, which adds to running time); and separates the xenopsd unit tests into 3 binaries for ease of testing.

Fixes #6240

lindig

Just some small comments; I did not look at the overall logic

ocaml/xenopsd/lib/topology.ml

contificate

I'm not overly familiar with NUMA, but the changes make sense to me (especially if they fix the cited issue).

Only minor nitpicks:

Unrelated to this change: CPUSet could just include Set.Make(Int).
The Array.for_all check of the row shadows d for a different semantic purpose.

ocaml/xenopsd/test/test_topology.ml

ocaml/xenopsd/lib/topology.ml

edwintorok · 2025-02-03T15:19:03Z

ocaml/xenopsd/lib/topology.ml

-             let self_distance = d.(i).(i) in
-             (distance_to_candidate self_distance, Seq.return i)
-         )
-    in
    let numa_nodes = Array.length d in


We could count how many reachable NUMA nodes we have, then we can avoid unreachable nodes artificially triggering the 16 node limit.

We should probably also log when the limit got triggered, so we can debug unexpected cases.

Although that can come as a separate PR, this is already a good improvement.

edwintorok · 2025-02-03T15:20:22Z

ocaml/xenopsd/lib/topology.ml

        valid_nodes
-        |> seq_all_subsets
-        |> Seq.filter_map (node_distances d)
-        |> seq_append single_nodes


I think the single nodes were always appended just in case something goes wrong with the filtering algorithm (which was meant to be somewhat smarter than brute force, or it may evolve in the future to be somewhat smarter).

As the algorithm currently looks like I agree that we don't need to append the single nodes here. Perhaps this would be a good condition to test for in the testcases, that single (reachable) nodes are always present with the expected value (unless such a test already exists).

edwintorok

Looks good, some suggestions on how the filtering could be extended to avoid artificially triggering the 16 NUMA node limit when not needed (although I don't think we currently have a system like that to test on, so this is purely theoretical, and something for the unit tests).

This allows to test independent modules faster more easily Signed-off-by: Pau Ruiz Safont <[email protected]>

…nce matrices Instead disable NUMA for the host Fixes xapi-project#6240 Signed-off-by: Pau Ruiz Safont <[email protected]>

Now the tests use the actual data from the test specifications instead of being hardcoded, and the distance matrices used for testing are in its own module for better clarity. Signed-off-by: Pau Ruiz Safont <[email protected]>

…able Unreachable nodes do not contain any CPUs, and therefore VCPUs cannot be scheduled on them. They marked with a value of (2ˆ32) - 1. Instead of failing to produce a NUMA object that allows for scheduling, create an object that contains only schedulable NUMA nodes. This means changing how the datastructures node_cpus and candidates are created to ignore the unreachable ones. Signed-off-by: Pau Ruiz Safont <[email protected]>

These could be created accidentally by dividing by 0. Signed-off-by: Pau Ruiz Safont <[email protected]>

It's unclear why the candidates with single nodes where always added, since the algorithm that generates all the subsets already includes these. Signed-off-by: Pau Ruiz Safont <[email protected]>

It's already printed by xenopsd, and now that development has stabilised, unit-test can print this useful unformation. Signed-off-by: Pau Ruiz Safont <[email protected]>

Signed-off-by: Pau Ruiz Safont <[email protected]>

…culation Now unreachable nodes are not considered when calculating all the subsets for the NUMA nodes combinations for scheduling a domain. Signed-off-by: Pau Ruiz Safont <[email protected]>

psafont · 2025-02-03T16:42:02Z

Rebased on top of latest master and made the limit in gen_candidates depend on the number of reachable nodes, ignoring the unreachable ones, as well as logging the situation.

stormi · 2025-02-03T17:04:42Z

Do you want another user test based on the latest iteration?

psafont · 2025-02-03T17:22:33Z

I don't think it's needed, thanks

psafont requested a review from edwintorok January 24, 2025 15:19

psafont force-pushed the private/paus/numaybe branch 2 times, most recently from d70811d to 8d287f5 Compare January 24, 2025 15:30

lindig approved these changes Jan 24, 2025

View reviewed changes

ocaml/xenopsd/lib/topology.ml Show resolved Hide resolved

ocaml/xenopsd/lib/topology.ml Outdated Show resolved Hide resolved

ocaml/xenopsd/lib/topology.ml Outdated Show resolved Hide resolved

psafont force-pushed the private/paus/numaybe branch 2 times, most recently from d4642ff to 619f8ba Compare January 27, 2025 10:56

contificate reviewed Jan 27, 2025

View reviewed changes

ocaml/xenopsd/lib/topology.ml Show resolved Hide resolved

contificate approved these changes Jan 27, 2025

View reviewed changes

psafont force-pushed the private/paus/numaybe branch from 619f8ba to 2735b54 Compare January 30, 2025 11:02

edwintorok reviewed Feb 3, 2025

View reviewed changes

ocaml/xenopsd/test/test_topology.ml Outdated Show resolved Hide resolved

edwintorok reviewed Feb 3, 2025

View reviewed changes

ocaml/xenopsd/lib/topology.ml Outdated Show resolved Hide resolved

psafont force-pushed the private/paus/numaybe branch from 2735b54 to e2d7d75 Compare February 3, 2025 11:50

edwintorok reviewed Feb 3, 2025

View reviewed changes

edwintorok approved these changes Feb 3, 2025

View reviewed changes

psafont added 9 commits February 3, 2025 16:40

xenopsd tests: split suite into 3 executables

ecb099b

This allows to test independent modules faster more easily Signed-off-by: Pau Ruiz Safont <[email protected]>

CP-53335, topology: do not raise exception when loading invalid dista…

1a022d8

…nce matrices Instead disable NUMA for the host Fixes xapi-project#6240 Signed-off-by: Pau Ruiz Safont <[email protected]>

test_topology: reorganise test cases

ad74029

Now the tests use the actual data from the test specifications instead of being hardcoded, and the distance matrices used for testing are in its own module for better clarity. Signed-off-by: Pau Ruiz Safont <[email protected]>

CP-53335, topology: Avoid distances with NaN among NUMA nodes

75e4c31

These could be created accidentally by dividing by 0. Signed-off-by: Pau Ruiz Safont <[email protected]>

CP-53335, topology: Avoid duplicates in candidate NUMA nodes

fc1d96f

It's unclear why the candidates with single nodes where always added, since the algorithm that generates all the subsets already includes these. Signed-off-by: Pau Ruiz Safont <[email protected]>

topology: do not print-debug the host NUMA information

3b08dfa

It's already printed by xenopsd, and now that development has stabilised, unit-test can print this useful unformation. Signed-off-by: Pau Ruiz Safont <[email protected]>

topology: Use specialised compare for CPUSet

cba91b8

Signed-off-by: Pau Ruiz Safont <[email protected]>

topology: ignore unreachable nodes for upper limit of candidates' cal…

9badc14

…culation Now unreachable nodes are not considered when calculating all the subsets for the NUMA nodes combinations for scheduling a domain. Signed-off-by: Pau Ruiz Safont <[email protected]>

psafont force-pushed the private/paus/numaybe branch from e2d7d75 to 9badc14 Compare February 3, 2025 16:40

psafont added this pull request to the merge queue Feb 3, 2025

Merged via the queue into xapi-project:master with commit 3234be2 Feb 3, 2025
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CP-53335, topology: do not raise exception when loading invalid distance matrices (NUMA) #6249

CP-53335, topology: do not raise exception when loading invalid distance matrices (NUMA) #6249

psafont commented Jan 24, 2025 •

edited

Loading

lindig left a comment

contificate left a comment

edwintorok Feb 3, 2025

edwintorok Feb 3, 2025

edwintorok left a comment

psafont commented Feb 3, 2025

stormi commented Feb 3, 2025

psafont commented Feb 3, 2025

CP-53335, topology: do not raise exception when loading invalid distance matrices (NUMA) #6249

CP-53335, topology: do not raise exception when loading invalid distance matrices (NUMA) #6249

Conversation

psafont commented Jan 24, 2025 • edited Loading

lindig left a comment

Choose a reason for hiding this comment

contificate left a comment

Choose a reason for hiding this comment

edwintorok Feb 3, 2025

Choose a reason for hiding this comment

edwintorok Feb 3, 2025

Choose a reason for hiding this comment

edwintorok left a comment

Choose a reason for hiding this comment

psafont commented Feb 3, 2025

stormi commented Feb 3, 2025

psafont commented Feb 3, 2025

psafont commented Jan 24, 2025 •

edited

Loading